![]() Resource exploitation management system, method and program product
专利摘要:
A resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database. 公开号:EP3683759A1 申请号:EP20152482.4 申请日:2020-01-17 公开日:2020-07-22 发明作者:Sonia Mariette EMBID DROZ;Cristina IBÁÑEZ-LLANO;Giorgio De Paola;Rubén RODRÍGUEZ TORRADO;Akiko Suzuki;Mustafa Canim;Yuan-Chi Chang;Robert Farrell;Sharon M. Trewin 申请人:Repsol SA;International Business Machines Corp; IPC主号:G06Q50-00
专利说明:
[0001] The present invention is related to the estimation of hydrocarbon reservoirs and more particularly to automatically consolidating geological information and knowledge, extracted from natural language text and used for estimating hydrocarbon reservoirs. Background Description [0002] Each new hydrocarbon reservoir has an inherent total value that is based on reservoir properties. The inherent value depends on the total amount of material that is ultimately recoverable from the reservoir (production potential) offset by the cost of recovering the material or capture difficulty. One or more experts estimate that value by identifying and selecting existing reservoirs based on geological knowledge from existing reservoirs. Using that knowledge, often from multiple information sources, including from unstructured document corpus for example, the experts identify those existing reservoirs (known as "analogous reservoirs") with certain aspects similar to the new reservoir. [0003] However, especially when extracting information from multiple geological knowledge sources, the nature of extraction produces results that vary in degree of trust, confidence and accuracy. Further, consolidating the evidence extracted from both unstructured and structured data sources using imperfect information extraction analytics provides fundamental challenges. Moreover, the petroleum geology domain is subject to constant discovery both physically and technologically. New sensor technologies make data collection more common and more precise. Simulations techniques better model the underground geological structure. Each discovery may introduce new evidence or conjecture to the current corpus and improve understanding of the structure of a new or existing reservoir. [0004] State of the art, knowledge extraction techniques are lossy, and applied to the raw data frequently provide conflicting results and/or contradicting statements. For example, descriptive sentences may contain co-references between nouns and pronouns that may not always resolve correctly in a straightforward way. Some sources may include point-in-time or out of date domain understandings. Also, conflicts may result from disagreement among knowledge corpus and data source creators or between experts. These conflicts may arise, especially with regard to previously unexplored geological regions where validated data is scarce. These conflicts force the reliance on experienced geologists for resolution. Moreover, each introduction may rewrite the previous version of a resource description. Even without a new discovery, if the current knowledge were completely accurate, adding new information to domain knowledge advances knowledge evolution naturally that may introduce inconsistent and/or incomplete knowledge assertions over time. Also, even without contradicting evidence experts may make different assumptions that leads to different conflicting conclusions, that results in inaccurate attribute and property associations. These variations and inaccuracies can cause selecting the wrong reservoir for a mis-valuation and wasted resources, e.g., from passing on an undervalued reservoir to exploit an overvalued reservoir. [0005] Thus, there is a need for accurately consolidating evidence from multiple sources; and, more particularly for resolving conflicts in data collected for new resources. SUMMARY OF THE INVENTION [0006] A feature of the invention is reduced reliance on experienced geologists for resolving conflicts in geological descriptions. [0007] Another feature of the invention is automatic detection of inconsistencies and contradictions in geological contexts. [0008] Yet another feature of the invention is automatic detection of inconsistencies and contradictions in geological contexts, and automatic generation of the level of confidence for consistent matches for reduced reliance on experienced geologists for resolving conflicts in geological descriptions. [0009] The present invention relates to a resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database. BRIEF DESCRIPTION OF THE DRAWINGS [0010] The foregoing and other objects, aspects and advantages will be better understood from the following detailed description of a preferred embodiment of the invention with reference to the drawings, in which:Figure 1 shows an example of a system for exploiting geological resources (e.g., hydrocarbon reservoirs) with newly received geological evidence, according to a preferred embodiment of the present invention; Figure 2 shows an example of updating geological resource descriptions with newly received geological evidence; Figures 3A - B show an example of using keyword pattern matching for identifying relevant knowledge triples for new geological evidence; Figures 4A - B show an example of identifying relevant knowledge triples for new geological evidence using geological relation inferencing; Figure 5 shows an example of determining and updating triple confidence level or trust of a matching set of knowledge triples. DESCRIPTION OF PREFERRED EMBODIMENTS [0011] As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a "circuit," "module" or "system." Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon. [0012] Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. [0013] A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. [0014] Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing. [0015] Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). [0016] Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. [0017] These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks. [0018] The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. [0019] Turning now to the drawings and more particularly, Figure 1 shows an example of a system 100 for exploiting geological resources (e.g., hydrocarbon reservoirs) with newly received geological evidence, according to a preferred embodiment of the present invention. The system 100 includes computers 102, 104 (2 in this example) providing a descriptive matching unit, geological relations inference matching unit and an aggregator, and a connected display 106 for matching, accepting and verifying results (preferably manually). The computers 102, 104 and connected display 106 are coupled, wired or wirelessly to, and communicate with each other over, network 108, e.g., a local area network (LAN), the Internet, an intranet or a combination or hybrid thereof. Typically, the computers 102, 104 include one or more processors, e.g., central processing unit (CPU) 110 and memory 112. The preferred system also includes a geological resource database 114, e.g., a reservoir characterization database, with facts describing geological resources. One or more of the computers 102, 104 may be in communication with sensors 116 located at a geological site 118 being monitored. [0020] With a new description or an updated description of a reservoir to computers 102, 104, the preferred system 100 identifies and displays 106 conflicting geological facts from the descriptions in geological resource database 114. Preferably, the facts are maintained as an n-ary relation such as (subject, predicate, object, location, time). These relationships can be refactored into a group of triples (subjects, predicates and objects). The refactored triples are expressed as related phrases or keywords that provide for a visual or graphic representation on display 106. Thus, although described herein with regard to using keywords, it is intended that reference to "keywords" may also refer to phrases interchangeably without departing from the invention. [0021] Whenever the preferred system 100 identifies or infers that two or more geological knowledge triples are inconsistent, the system 100 displays 106 all inconsistencies for resolution by subject matter experts. The term subject matter experts may be interpreted according to an embodiment by human subject matter experts and according to a further embodiment by a AI (artificial intelligent) machine being trained by a human expert for the resolution of inconsistencies. The experts can review the inconsistent triples and recommend choices for resolution. Optionally, the preferred system 100 can infer results that are more likely to be correct. By matching agreements and confidence levels from pre-established geological, geographical, and temporal hierarchy, e.g., from geological resource database 114. The inferred results facilitate resolution, highlighting any of those results that are more likely to be correct. [0022] So, for example, triples may indicate that Hue Shale was formed during Cretaceous, Triassic and Oligocene periods. Applying text extraction algorithms to the reported knowledge triple(s), the system 100 may infer that Hue Shale was formed during Cretaceous period, and highlight this on the display 106. An expert reviewing the displayed results, may approve the highlighted selection as correct or reject it. Alternatively, the expert may select a different knowledge triple, declaring selected triple as the correct result. Either way, the preferred system 100 records the result in the geological resource database 114, and removes or discards all incorrect triples. [0023] Figure 2 shows an example 120 of updating geological resource descriptions with newly received 122 geological evidence in the preferred system 100 of Figure 1 with like features labeled identically. The system 100 compares the new geological evidence, preferably in the form of triples, against existing geological resource descriptions in a geological resource database 114, as well as existing sources of the descriptions. Those existing sources may include, for example, published geological survey reports, maps, geological event charts, and geological formation diagrams. The descriptive matching unit identifies keyword patterns 124 and the geological relations inference matching unit identifies or infers geological relations 126. Keyword pattern matching 124 and geological relation inference 126 may be done separately or in parallel as shown here. [0024] The system 100 renders the results 124, 126, e.g., on display 106, highlighting any text from the existing sources, e.g., report entries, map locations, event chart periods, or formation layers. Each respective unit finds 128, 130 determines the confidence level of a respective match/inference based on the matched existing descriptions 128 and 130. The matches/inferences are displayed 106 with the respective confidence level for user review, and approval or rejection. The aggregator merges 132 approved matches/inferences into the matched sets in a geological resource database 114. [0025] Upon discovery of a new resource, or upon receipt 122 of new information (e.g., from a discovery or an update) on the resource, the preferred system 100 encodes the information, e.g., as rules that describe basic geological, spatial and temporal knowledge. In particular, the preferred system 100 uses geophysical, lithology, and/or petro-chemical principles to define governing geological relations for coding machine interpretable rules. The petro-chemistry industry and academic publications have existing standard that are universally and consistently used definitions, and defined for petroleum geology. For petro-chemical analysis the preferred system 100 maps these pre-defined nomenclature sets to physical properties of rock types, organic content. Using this industry-wide and academic agreement, the present invention provides for comparing and contrasting geological evidence from multiple sources. [0026] A typical field description or rule(s) in a hydrocarbon database may include several different types of field attribute descriptors that are related to the field by several different types of relational characteristics. The attribute types may include, for example, geographical, time or temporal, geological, petroleum system, rock, hydrocarbon and other (everything else). Relational types may indicate, for example, that the field: contains, overlies, had a depositional environment, formed during, composed of, located at, has the property of, is associated with, and/or has many of, the particular features. [0027] Preferably, from the rules the system 100 represents the resource information in the form of triples. Preferably also, each triple represents two entities or nodes and the relation between the two. For example, a triple may indicate that a geological/rock formation (entity) formed (relation) during a specific temporal interval (associated geological time). The preferred system 100 focuses on petroleum geology to apply background knowledge for improving knowledge extraction from new geological evidence 122. [0028] The system 100 may collect new geological evidence 122 automatically, e.g., using a searchbot or from physical data as it is collected. A searchbot automatically collect new geological evidence 122 online from geological survey publications, conference proceedings, meeting minutes, and proprietary databases, as well. Many petroleum exploration groups maintain and curate free publications and proceedings. Petroleum databases typically have curated structures. Suitable knowledge extraction tools for extracting keywords and phrases include, for example, IBM Watson Knowledge Studio. Physical data collection may be, for example, from location sensors 116, or as provided by personnel checking or monitoring the site 118. [0029] After or during collection, the system 100 represents the new geological evidence 122, preferably, as one or more triples, or in another suitable knowledge format, e.g., W3C Web Ontology Language. Preferably, the triples also include an associated confidence level indication, when available. [0030] Established geological facts exist regarding source rock. For example, Middle Devonian to lower Mississippian epochs are known for widespread marine anoxic oil and gas source beds that are located Mid-Continent (North America) and in Appalachia. Also, upper Jurassic marine mudstone or its stratigraphic equivalents, known as Kimmeridge Clay, generated most of the oil found in the North Sea and the Norwegian Sea. The late Cretaceous Turonian formation, known as La Luna Shale, generated most of the oil in Venezuela. The Marcellus Formation, for example, overlies the Onondaga Formation, is a unit of the Hamilton Group, and formed during the Early Pennsylvanian sub-period. Expressing this as triples: Marcellus Formation, overlies, Onondaga Formation; Marcellus Formation, Unit of, Hamilton Group; and Marcellus Formation, Formed during, Early Pennsylvanian. [0031] A logical inference (or inferred fact) of the above facts, for example, is that the Marcellus Formation (entity) is located (relation) at the Appalachian Basin (geolocation). It is also known that source rock in the Appalachian Basin formed during the Middle Devonian to Lower Mississippian sub-periods. A logical inference of this is that Marcellus Formation formed during the Middle Devonian to Lower Mississippian sub-periods. However, these two logical inferences contradict each other, creating a detectable inconsistency, automatically identified by the preferred system 100. [0032] Figures 3A - B show an example of the descriptive matching unit (e.g., in computers 102, 104 in Figure 1) performing keyword pattern matching 124 to identify relevant knowledge triples 128 from new geological evidence. First, the descriptive matching unit decomposes 124 the new geological evidence into a set of keyword triples 1240 derived from subjects, predicates and objects. Then, the preferred system 100 uses the keyword set from the triples to query 1280 geological resource database entries 1282 for matches 1284. The descriptive matching unit retrieves 1286 matched triples 1284, identifies 1288 any highly relevant sets 1290, e.g., using triple and/or graph similarity matching. Triple matching requires a match of two out three elements. In graph similarity triples linked by subjects or objects match whenever predicate links match above a threshold, e.g., 80%. [0033] For example, a typical new geological triple 1240 may indicate a formation (Marcellus Formation), a relationship term (Formed during) and a corresponding time period (Early Pennsylvanian). In this example, the geological resource database entries 1282 include six (6) triples. The database entries 1282 triples indicate three (3) formations (Marcellus Formation, Onondaga Formation and La Luna Formation), a common relationship term (Formed during) and five (5) corresponding time periods (Early Pennsylvanian, Middle Devonian to Lower Mississippian, Middle Devonian, Mississippian and Cretaceous). [0034] Preferably, the descriptive matching unit uses an n-gram match, for example, to find matches 1284 in the geological resource database entries 1282. An n-gram is a contiguous sequence of n items from a given sequence of text or speech. Thus, the keyword query 1280 identifies matches 1284 with a common formation (Marcellus Formation) and common relationship term (Formed during), and in four (4) time periods (Early Pennsylvanian, Middle Devonian to Lower Mississippian, Middle Devonian and Mississippian). The descriptive matching unit retrieves 1286 matched triples 1284, identifies 1288 highly relevant sets 1290 (an exact match in this example), and returns 1292 that highly relevant match 1290. [0035] Figures 4A - B show an example of the geological relation inferencing unit (e.g., in computers 102, 104 in Figure 1) identifying relevant knowledge triples for new geological evidence using geological relation inferencing 126. The descriptive matching unit decomposes 126 the new triple 1260 derived from geological subjects, predicates and objects as a set of keywords. Then, the inferencing unit traverses 1300 geological resource database entries 1302 for predefined geological relations 1304 with matching predicates, which may result in an expanded set of keywords 1306. The inferencing unit applies 1308 the expanded set to the database entries 1302 to identify matches 1310. The inferencing unit retrieves 1312 matched triples 1312, identifies highly relevant sets, e.g., using triple and/or graph similarity matching, and returns 1314 the highly relevant sets 1312. [0036] In this example, a typical new geological triple 1260 may indicate a formation (Marcellus Formation), a relationship term (Unit of, and Located at) and a corresponding location (Hamilton Group and Appalachian Basin). Also in this example, the geological resource database entries 1302 include three (3) triples. The database triples indicate two (2) formations (Marcellus Formation, and La Luna Formation), a common relationship term (Located at) and three (3) corresponding locations (Appalachian Basin, eastern North America and Catatumbo). [0037] The geological relations 1304 include three (3) formation groups (Appalachian Basin, Hamilton Group, and Catatumbo Marcellus Formation), a common relationship term (Located at), and two (2) locations (eastern North America and Columbia). The preferred system 100 retrieves 1312 matched triples 1310, identifies two (2, both in this example) highly relevant sets with a common formation (Marcellus Formation) and relationship term (Located at), and at two (2) locations (eastern North America and Columbia). The preferred system 100 returns 1314 these 2 highly relevant matches 1310. [0038] Figure 5 shows an example of the aggregator (e.g., in computers 102, 104 in Figure 1) determining and updating 132 the confidence level or trust of a matching triples or sets of knowledge triples, 1290, 1310 in Figures 3A - B and 4A - B. The aggregator may track all mismatches or incorrect matches, and optionally track all correct matches, coupled with corresponding originating information sources. Suitable statistical measures of confidence level include, for example, majority vote, weighted authoritative sources, and machine learned likelihood statistics. Originating information sources may include a combination of unstructured text corpus and knowledge extraction tools applied to the corpus. Tracking history may be maintained in the geological resource database 114. [0039] Using geological, geographical and temporal inferencing rules, the system 100 identifies 1320 consistent and conflicting knowledge triples. Then, the aggregator aggregates 1322 triples, consistent and conflicting, by confidence level using, for example, an average or a majority vote. The aggregates ranks 1324 the aggregated confidence level for consistent triples against the confidence level associated with any conflicts. Thus, for sources used in ranking 1324 the confidence scores may be determined from comparing the number of consistent triples against the number of inconsistent triples. Viewing the scores associated with knowledge sources, e.g., on display 106, an expert can decide whether to include or exclude each result from future knowledge ingestion, and can selectively remove any knowledge triples from excluded sources. [0040] For example, taking into account geological context, the Meeteetse Formation is known to have a thermal maturity (ThermalMaturity) of over mature (overmature) for the current time period, i.e., today. During the Cretaceous period (time period) the Meeteetse Formation is known to have a mature (mature) thermal maturity (ThermalMaturity). These triples are consistent and may be aggregated. [0041] If the aggregated confidence level ranks 1326 greater than the conflicting evidence, then the aggregator merges 1328 new knowledge triple(s) 1290, 1310 with the knowledge database. Otherwise, the match is left open for further interpretation, wherein the system 100 displays 106 the conflict and issues automated request 1330 for clarification. Experts (e.g., geoscientists or AI machines) may answer the automated request 1330. Alternately, the automated request 1328 may trigger additional corpus acquisition and ingestion 1332. Such, acquisition and ingestion 1332 may focus on the resource area or location for additional evidence to reach a resolution. After merging 1328, the preferred system 100 updates 1334 the confidence level of the matched sets. [0042] Thus advantageously, the preferred system detects inconsistencies and contradictions in geological contexts represented as triples or in a similar knowledge representation. The received representations may be matched by keyword pattern matching or inferred relations defined in geological rules to detect conflicts, e.g., from inconsistencies. Consistent representations may be appended and merged with existing representations triples in a knowledge database or knowledge store. Consistent and inconsistent representations may be maintained in the store serving as aggregation point of the geological findings. The merged set is updated with a new confidence score determined using a suitable confidence measure. The confidence score provides a system generated confidence measure about stored knowledge on the area. [0043] While the invention has been described in terms of preferred embodiments, those skilled in the art will recognize that the invention can be practiced with modification within the scope of the appended claims. It is intended that all such variations and modifications fall within the scope of the appended claims. Examples and drawings are, accordingly, to be regarded as illustrative rather than restrictive.
权利要求:
Claims (20) [0001] A resource exploitation management system comprising: a resource description storage storing representations of known resources; a descriptive matching unit configured to match keywords in new resource descriptions against keywords in stored said representations; a geological relations inference matching unit configured to match relations geologically inferred from new resource descriptions against for predefined geological relations; an aggregator configured to aggregate consistent and inconsistent matched representations; and matching means configured for providing inconsistent aggregated matches for clarification. [0002] A resource exploitation management system as in claim 1, wherein said new resource descriptions comprise evidence and conjectures triples regarding a respective resource in a generic knowledge representation. [0003] A resource exploitation management system as in claim 1 or 2, wherein said descriptive matching unit is configured to compare said evidence and conjecture triples with keywords in said representations to identify matches with said known resources, and is configured to identify any matching triples with a confidence level above a threshold as highly likely. [0004] A resource exploitation management system as in claim 2 and any of previous claims, wherein said generic knowledge representation comprises a plurality of evidence and conjecture triples and said geological relations inference matching unit is configured to compare said evidence and conjecture triples with geologically inferred relations in said representations to identify matches with said known resources, and is configured to identify any matching triples with a confidence level above a threshold as highly likely. [0005] A resource exploitation management system as in claim 2 and any of previous claims, wherein said generic knowledge representation comprises a plurality of evidence and conjecture triples and said aggregator comprises: means for determining a confidence level for each matched triple; means for identifying said each matched triple as a consistent or conflicting match; means for aggregating triple confidence level, consistent and conflicting confidence level being aggregated for said each matched triple; and means for ranking matched triples by aggregated consistent confidence level against aggregated conflicting confidence level. [0006] A resource exploitation management system as in claim 5 and any of claims 1 to 4, wherein whenever said aggregated consistent confidence level is greater than said aggregated conflicting confidence level for a ranked matched triple, said aggregator merges said ranked matched triple with said resource description. [0007] A resource exploitation management system as in claim 5 and any of previous claims, wherein said aggregator compares the number of consistent triples against the number of inconsistent triples, the difference indicating a confidence scores, and said manual matching means includes a display, said display displaying said knowledge scores associated with knowledge sources, such that it can be decided whether to include or exclude each result from future knowledge ingestion, and it can be selectively removed any knowledge triples from excluded sources. [0008] A method of exploiting geological resources, said method comprising: receiving a corresponding text with description of new geological evidence for a geological resource; matching keywords in said description against keywords associated to representations in a geological resource database; inferring geological relations from said descriptions; matching said inferred geological relations against predefined geological relations from said geological resource database; merging consistent matches with said geological resource database; and updating a confidence level in said geological resource database for merged matches. [0009] A method as in claim 8, wherein said description comprises one or more triples describing said new geological evidence. [0010] A method as in claim 9, wherein matching keywords comprises: decomposing said one or more triples into a plurality of geological formation keywords; querying said geological resource database with said plurality of geological formation keywords for triples with keyword matches; retrieving said triples with keyword matches from said geological resource database; identifying highly relevant said retrieved triples; and returning identified highly relevant geological formation triples as keyword matches. [0011] A method as in claim 10, wherein said plurality of geological formation keywords are decomposed from geological resource subjects, predicates and objects, and highly relevant triples are determined by graph similarity matching to identify highly relevant sets. [0012] A method as in claim 9 and any of claims 8 to 11, wherein inferring geological relations comprises: decomposing said one or more triples into a plurality of geological geographically related keywords; querying said geological resource database with said plurality of geological geographically related keywords for matching geographical relationships; selectively expanding said plurality of geological geographically related keywords responsive to query matches; matching the geological geographically related keywords with triple in the geological resource database entries; retrieving matching said triples from said geological resource database; identifying highly relevant said retrieved triples; and returning identified highly relevant triples as geologically inferred matches. [0013] A method as in claim 12, wherein said plurality of geological geographically related keywords are decomposed from geological geographically relation subjects, predicates and objects, and highly relevant triples are determined by graph similarity matching to identify highly relevant sets. [0014] A method as in claim 9 and any of claims 8 to 13, wherein merging consistent matches comprises: identifying said keyword matches and said inferred geological relations matches as consistent and conflicting; aggregating consistent and conflicting matches; identifying as consistent matches any triples with aggregated consistent matches exceeding conflicting matches; and merging said identified with said matched geological resource database entries. [0015] A method as in claim 14 and any of claims 8 to 13, wherein whenever the aggregated consistent matches do not exceed conflicting matches, merging consistent matches further comprises displaying a clarification request. [0016] A method as in claim 14 and any of claims 8 to 15, wherein whenever the aggregated consistent matches do not exceeding conflicting match, merging consistent matches further comprises triggering additional corpus acquisition and ingestion focused on resource area or location. [0017] A computer program product for exploiting geological resources, said computer program product comprising a non-transitory computer usable medium having computer readable program code stored thereon, said computer readable program code causing one or more computers executing said code to: receive a corresponding text with description of new geological evidence for a geological resource, said description comprising one or more triples describing said new geological evidence; match keywords in said description against keywords associated to representations in a geological resource database; infer geological relations from said descriptions; match said inferred geological relations against predefined geological relations from said geological resource database; merge consistent matches with said geological resource database; and update a confidence level in said geological resource database for merged matches. [0018] A computer program product for exploiting geological resources as in claim 17, said wherein computer readable program code matching keywords causes said one or more computers executing said code to: decompose said one or more triples into a plurality of geological formation keywords; query said geological resource database with said plurality of geological formation keywords for triples with keyword matches; retrieve said triples with keyword matches from said geological resource database; identify highly relevant said retrieved triples; and return identified highly relevant geological formation triples as keyword matches. [0019] A computer program product for exploiting geological resources as in any of claims 17 to 18, said wherein computer readable program code inferring geological relations causes said one or more computers executing said code to: decompose said one or more triples into a plurality of geological geographically related keywords; query said geological resource database with said plurality of geological geographically related keywords for matching geographical relationships; selectively expand said plurality of geological geographically related keywords responsive to query matches; match the geological geographically related keywords with triple in the geological resource database entries; retrieve matching said triples from said geological resource database; identify highly relevant said retrieved triples; and return identified highly relevant triples as geologically inferred matches. [0020] A computer program product for exploiting geological resources as in any of claims 17 to 19, said wherein computer readable program code merging consistent matches causes said one or more computers executing said code to: identify said keyword matches and said inferred geological relations matches as consistent and conflicting; aggregate consistent and conflicting matches; identify as consistent matches any triples with aggregated consistent matches exceeding conflicting matches; and merge identified said with matched said geological resource database entries; and whenever the aggregated consistent matches do not exceed conflicting matches display a clarification request.
类似技术:
公开号 | 公开日 | 专利标题 Gao et al.2017|Constructing gazetteers from volunteered big geo-data based on Hadoop US10460235B1|2019-10-29|Data model generation using generative adversarial networks US10102220B2|2018-10-16|Activity based analytics US10599732B2|2020-03-24|Methods and systems for discovery of linkage points between data sources Jurgens et al.2015|Geolocation prediction in twitter using social networks: A critical analysis and review of current practice Li et al.2014|Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation US20200210610A1|2020-07-02|Differentially Private Processing and Database Storage EP2973041B1|2018-08-01|Apparatus, systems, and methods for batch and realtime data processing CA2916762C|2016-07-26|Control variable determination to maximize a drilling rate of penetration Pujara et al.2013|Knowledge graph identification US10664757B2|2020-05-26|Cognitive operations based on empirically constructed knowledge graphs US10296658B2|2019-05-21|Use of context-dependent statistics to suggest next steps while exploring a dataset Kocaguneli et al.2013|Software effort models should be assessed via leave-one-out validation CN104737166B|2018-09-18|Data lineage system US20200012968A1|2020-01-09|Classifying user behavior as anomalous Baddeley et al.2005|Residual analysis for spatial point processes | US20190162868A1|2019-05-30|Multi-Scale Deep Network for Fault Detection KR101691243B1|2016-12-29|Merging search results US7930262B2|2011-04-19|System and method for the longitudinal analysis of education outcomes using cohort life cycles, cluster analytics-based cohort analysis, and probabilistic data schemas CN102567464B|2015-08-05|Based on the knowledge resource method for organizing of expansion thematic map Holdaway2014|Harness oil and gas big data with analytics: Optimize exploration and production with data-driven models Bhattacharya et al.2018|Applications of machine learning for facies and fracture prediction using Bayesian Network Theory and Random Forest: Case studies from the Appalachian basin, USA US6430547B1|2002-08-06|Method and system for integrating spatial analysis and data mining analysis to ascertain relationships between collected samples and geology with remotely sensed data AU2007211291B2|2012-03-22|Methods, systems, and computer-readable media for fast updating of oil and gas field production models with physical and proxy simulators US6370547B1|2002-04-09|Database correlation method
同族专利:
公开号 | 公开日 US20200233851A1|2020-07-23|
引用文献:
公开号 | 申请日 | 公开日 | 申请人 | 专利标题
法律状态:
2020-06-19| STAA| Information on the status of an ep patent application or granted ep patent|Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED | 2020-06-19| PUAI| Public reference made under article 153(3) epc to a published international application that has entered the european phase|Free format text: ORIGINAL CODE: 0009012 | 2020-07-22| AK| Designated contracting states|Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR | 2020-07-22| AX| Request for extension of the european patent|Extension state: BA ME | 2020-12-16| RIN1| Information on inventor provided before grant (corrected)|Inventor name: DE PAOLA, GIORGIO Inventor name: IBANEZ-LLANO, CRISTINA Inventor name: EMBID DROZ, SONIA MARIETTE Inventor name: CHANG, YUAN-CHI Inventor name: FARRELL, ROBERT Inventor name: RODRIGUEZ TORRADO, RUBEN Inventor name: TREWIN, SHARON M. Inventor name: MURAKAMI, AKIKO Inventor name: CANIM, MUSTAFA | 2021-05-28| STAA| Information on the status of an ep patent application or granted ep patent|Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN | 2021-06-30| 18D| Application deemed to be withdrawn|Effective date: 20210123 |
优先权:
[返回顶部]
申请号 | 申请日 | 专利标题 相关专利
Sulfonates, polymers, resist compositions and patterning process
Washing machine
Washing machine
Device for fixture finishing and tension adjusting of membrane
Structure for Equipping Band in a Plane Cathode Ray Tube
Process for preparation of 7 alpha-carboxyl 9, 11-epoxy steroids and intermediates useful therein an
国家/地区
|